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SEMI-AUTOMATIC INDEXING AND ENCODING 


INTRODUCTION 


Most mechanized information retrieval systems 
are dependent upon the extraction of inforrnation 
from documents, processing such information in 
some manner and finally placing this information 
in a file so that the information can be retrieved 
by machine. Any such retrieval system is only 
as good as the degree of accuracy and thorough- 
ness of the extraction of information from the 
document. 

Analysis can be defined as the process of ex- 
tracting information from a document. This in- 
formation is often in the form of notions or con- 
cepts. 

Indexing processes the information resulting 
from analysis by placing it in lists. 

Coding creates and assigns specific unambiguous 
terms as substitutes for such notions or concepts 
(13). The terms may come from the document it- 
self. 

Encoding prepares a document for storage by 
substitution of codes or terms for the information 
already extracted from the document. 

To date, in the development of most of the 
mechanized information retrieval systems, includ- 
ing the three mechanical retrieval systems de- 
veloped in the U. S. Patent Office (1,2,3) the major 
portion of the effort of analyzing and encoding docu- 
ments has been essentially manual. Only a small 
portion of this work has been mechanized. 

It is desirable to attempt to shift the major 
burden of the analyzing and encoding from humans 
to machines. This paper will describe and il- 
lustrate an experimental attempt to so do. This 
attempt suggests a possible interim solution to 
these problems. The extraction, listing, indexing 
and encoding of the structural formulae not specifi- 
cally named in the document are treated inanother 
manner not included in this paper. 

Four extremely important factors which have 
been considered are those of (1) personnel, (2) 
training, (3) product obtained and (4) cost. I have 
Called this “a semi-automatic indexing and en- 
coding method”. In this experiment humans are 
used for what they are best suited, e.g. reading 
and recognition of concepts, and machines for what 


they can do best, e.g. fast, accurate, and repetitive 
action. 


_ The usual two step procedure was used, consist- 
ing, first, of extracting the material from the docu- 
ment and, then, encoding it in a form suitable for 
making searches. The particular method used in 
the first step additionally resulted in a method for 
indexing. Such auxiliary indexing is of extreme 
importance in serving as a surrogate for thédocu- 
ment itself for the purpose of browsing in manual 


retrieval systems. The first step will, accordingly, 
be called indexing. 

An excellent start in automatic abstracting has 
been made by H. P. Luhn (4,5,6,7) and associates. 
However, these systems cannot now be used by 
the Patent Office. Even if the text of the docu- 
ments were available in “machine-readable” form 
and a computer such as the IBM 704 were availi- 
able, no program for performing the type of 
analysis we find necessary is now known. The 
ultimate solution awaits not only the development 
of machines able to read any text in any type 
font, but the discovery of those rules of language 
that would enable machines to process the text 
on the basis of its information content. Until such 
developments are forthcoming, humans will still 
play an important part in the task of analyzing and 
encoding documents. H. P. Luhn (8) and C. K. 
Schultz et al. (10) also suggest mechanized methods 
for indexing terms which they extract. 


INDEXING 


Indexing is the listing of words, subjects, or 
concepts found in, or otherwise associated with a 
document, which enables a person to find that 
document. Indexing in depth is an extremely use- 
ful tool for Patent Office use. Deep indexing by the 
usual manual methods is quite expensive and has 
required very highly trained personnel (11). In 
“subject” indexing particularly, extracted terms 
or concepts are placed under headings according to 
some specified scheme of classification. This 
requires personnel skilled both in the discipline 
involved and in the art of classification. 

Most indexes carry no indication of the depth of 
the indexing, since this factor is left to the judg- 
ment of the indexer. While he is always conscious 
of the time required to make the index, he is, per- 
haps, not as conscious of all the varied require- 
ments of the users. If a method of indexing is 
chosen which has definite rules, a less skilled 
indexer can be used. The uniform application of 
these rules would control the depth of indexing, and 
knowledge of them will indicate that depth. Our 
experimental method enables the user to know 
both the depth of indexing and the point of view 
from which the indexing was carried out. 

Indexes in which the user can tind the item de- 
sired either in the larevera of the document or 
under an appropriate subject heading appear de- 
sirable. 

The following steps were taken in preparing the 
index: 


1. Determination of the point of view or objective 
of indexing, e. g., extracting the chemical com- 
pounds from a document or chemical processes; or 
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in disciplines other thanChemistry extracting clec- 
trical components or systems. 

2. Determination of depth of indexing desired, 
e.g. extracting alf chemical compounds, all steroid 
compounds, only phosphate compounds or only 
electrical components. 

8. Determination of the rules for extracting 
terms or concepts, in accordance with the point of 
view and depth of indexing, is communicated to the 
indexer. The more precise and simple the rules 
are, the less training of personnel is needed, and 
the better the index. 

4. Conversion of extracted terms to machinable 
form, e.g. converted into punched cards. 

5. Elimination and counting of duplicates by 
machine processing. The elimination of synonyms 
is not done at this time. 

6. Printing of lists or indexes by machine. 


These first steps now result in a list of terms 
for each document. The machine can then arrange 
and print an alphabetical list of all the terms in all 
the documents indexed, again eliminating the dupli- 
cates, while indicating the plural documents in 
which each term appears. 

Experimental work has been carried on in the 
U. S. Patent Office since April, 1959. Several 
specific examples of this system will now be de- 
scribed. 


Steroid Example 


Ali “steroid compounds,” not shown by struc- 
tural formulae, are extracted from each patent of 
the 2,100 patents in the twelve steroid subclasses 
(1). 

The technical personnel were instructed to 
underline each word which recited a steroid com- 
pound ‘as ‘it occurred and every time it occurred. 
Thus a steroid compound occurring seven times 
was underlined seven times. Hence no effort or 
time was lost in determining whether a compound 
was previously underlined. Structural formulae 
were not underlined. The underlined patent was 
next given to a punch card operator who punched 
the underlined work in alphanumeric text. Onecard 
was punched for each steroid compound. Trailer 
cards were used as needed. The patent number 
in which the word occurred was also punched in 
each card. A RAMAC 305, by suitable program, 
accepted these cards, copied each card into its 
disc memory and then eliminated duplicate in- 
formation. Next it punched out a new deck of 
cards which eliminated duplicates and printed a 
list of steroid compounds in alphabetical order, 
containing no duplicates. Each card and its 
listing included the patent number and the number 
of times the term appeared in the patent. This list 
constitutes an index of all the steroid compounds 
in each patent, excluding those represented by 
structural formulae. These could be easily in- 
cluded if desired. The resulting list tends to be 
complete and accurate because of the way it is 
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generated. In most patents each steroid compound 
is usually repeated two or more times. Ifthe 
indexer failed to underline one occurrence, it is 
probable that the second or third occurrence would 
be underlined. Furthermore, since the list also 
contains the number of times the compound oc- 
curred, this may be an indication of its importance 
in the document (5,6). This list can be attached 
to the patent to aid in manual searching of the 
patent. It also enables one to quickly see ifa 
particular steroid compound is present in the 
patent. Specific examples of an underlined patent 
and such a prepared list can be seen in the ap- 
pendix. (A). A flow chart of the program is also 
appended. (B). Another RAMAC program takes 
the alphabetized deck of the entire file of docu- 
ments, and assigns address numbers of the RAMAC 
storage (9) for each different concept. 


Personnel 


In contrast to other methods, this method of in- 
dexing was done with personnel who have only a 
B. S. degree in chemistry. No other formal 
schooling appeared to be necessary. In fact, in 
two instances, personnel with only 1 year of 
chemistry were trained to do this work satis- 
factorily. However, a science degree is more de- 
sirable because it reduces the amount of training 
necessary. Highly trained indexers are notneeded 
because a knowledge of steroid chemistry is not 

ecessary. All that is required is the ability to 
recognize that a word describes a steroid com- 
pound. 


Training Time 


Relatively little time is required to train an in- 
dexer adequately to recognize steroid compounds, 
rarely more than a week. His training consists of 
supervised extraction and each of his errors are 
discussed in detail with him during the training 
period. 


CU 


Cost ‘ 


One of the attractive features of this system 1s 
low cost. The average trained extractor can 
underline thirty-two steroid patents a day. The 
ayerage salary is about $2 per hour. Thus this 
intellectual effort costs about fifty cents Per 
patent. Punching costs about $1.45 per patent. 
Machine processing on the RAMAC costs about 
cents a document. Additional steps in processing 
account for anadditional direct cost of $.25 a patent. 
Hence the average cost of the index per document 
is about $3.10. For this cost both the printed list 
or index and the punched cards, which can be used 
in encoding, are obtained. 


Products 


As previously stated, two products result 
rectly from this process. One ofthemis a pr! 


list of all the steroid compounds, excluding those 
disclosed only by a structural formula; the other 
ia a set of punched cards containing the steroids 
with no duplicates. : 

Many other lists and classifications can also be 
printed by the machine without returning to the 
original document. Thus, if one desired steroid 
compounds which are esters, or all those which 
have keto groups or which have molecular weight 
above 500, 2 highly skilled chemist can take such 
subject headings and, consulting the addresses of 


I level 


the terms extracted from the documents, list all 
such terms which belong under each heading. The 
machine can then print these listings by a slight 
modification of the program used in (9). 

Since the smallest unit of information, the com- 
pound, was extracted from thedocument, reference 
to the document for indexing, encoding, or separa- 
tion of the compounds into various categories or 
classifications can be handled without having to 
refer to the original document. Thus the chart 
below ilJustrates some of the possibilities. 


oxygenated steroids 


IL level ee 


keto compotinds pregnane hydroxy compounds halogen compounds 


— 
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Ill level Ia ee 
<<< 


3, 11-dihydroxypregnane 11-ketopregnene 3,20-dlketopregnane 1-hydroxy-3-chloropregnane 11-hydroxypregnene_ 


Level Ill is essentially the original index. Levels 
{ and Ii can be created in a manner similar to the 
subject matter headings by one skilled person. It 
is now easy to prepare other lists such as: 


keto steroids 


li-ketopregnene (Pat. #) 
3,20-diketopregnane (Pat. #) 


hydroxy steroids 


3,11-dihydroxypregnane (Pat. #) 
i-hydroxy-3-chloropregnane (Pat. #) 
1l-hydroxypregnene (Pat. #) 


halogen compounds 
1-hydroxy-3-chloropregnane (Pat. #) 
pregnane compound 


3,11-dihydroxypregnane (Pat. #) 

3,20-diketopregnane (Pat. #) 

ll-hydroxypregnane (Pat. #) 

1-hydroxy-3-chloropregnane (Pat. #) 
oxygenated compounds 


3,11-dihydroxypregnane (Pat. #) 
3,20-diketopregnane (Pat. #) 
11-hydroxypregnene 
1l-hydroxy-3-chloropregnane (Pat. #) 
il-ketopregnane (Pat. #) 

Thus it can be seen that almost any type of in- 
dex or subject heading list can be prepared without 
reference to the original document or without the 
need of having every extractor know theclassifica- 
tion scheme. ; 

A typical list of compounds generated by this 
method appears in the appendix (C). 


Resin Example 


The methods previously described can be used 
additionally to extract concepts or relationships. 
The following abbreviated example may be appli- 

le to the polymers for ultimate use in searching 
them (3,9). 


Determination of the point of view and the depth 
of indexing resulted in the following rule: 


Extract aff compounds in a patent which are 
catalysts, soivenis, monomers. In addition to the 
underlining, the personnel were instructed towrite 
adjacent thereto the number (1) for catalyst, (2) 
for solvent and (3) for monomer. If the number of 
kinds of different concepts or relationships is kept 
small, the extractors work satisfactorily. A spe- 
cific example is shown in the appendix (C). In- 
dexes can easily be printed for all catalysts, all 
solvents, all monomers, or all compounds in the 
patent regardless of their function. See appendix 


Phosphate Example 


There are seven people now extracting informa- 
tion by this method from the 1,765 phosphorus 
ester patents in Class 260 subclass 461. 

The personnel are instructed to underline all 
phosphorus compounds. This results in having 
both generic and specific compounds extracted. 
The resulting list is an index to all the phosphorus 
compounds in a patent, excluding structural formu- 
lae. See appendix A. A list can be produced con- 
taining all the phosphorus compounds in all the 
1,765 patents, and under each compound listing the 
patents in which they occur, for example: 


Trimethyl Tributyl Tricresyl 
phosphate phosphate phosphate 


Pat. No. Pat. No. Pat. No. 
Pat. No. Pat. No. Pat. No. 
Pat. No. Pat. No. Pat. No. 


No attempt is made to eliminate or correlate 
synonyms since the search system provides means 
to allow such redundancy. The index will be quite 
accurate because of the redundancy of extraction, 
important compounds usually appearing several 
times in a document. 


| 
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AUTOMATIC ENCODING 


From the punched cards so created the docu- 
ment can be automatically encoded. A detailed il- 
lustration of the coding of the steroids will now be 
described. Although not described in this paper, 
the phosphates and polymers can be semi-auto- 
matically or automatically encoded by using similar 
procedures and techniques. 

A slight modification of the first step (extracting) 
requires that the compound not only be underlined 
but that a virgule be placed to separate the various 


selected parts of the compound, for example: 
(appendix D) 
3,20-diketo/pregnane/, 


$-hydroxy/ll-keto/17-methyl/androstane/. 
i 4-pregnene/3,i1,20 trione/. 


The duplicates are eliminated from the punched 
cards as previously described. Before automatical- 
ly encoding, a dictionary must be generated. 


Dictionary Generation 


Two methods are available for generating a 
dictionary. In the first method one attempts to 
list ail the possible entries. There are two 
serious defects to this method, (1) itis impractical 
and time consuming to think of all the entries, and 
(2) some of the entries never used must still be 
stored at no small cost. The second method is to 
use the extracts frorn the documents in a suitable 
computer program to generate thedictionary. This 
Was experimentally and successfully tried with the 
steroids. Extracts from the first document were 
put into the machine, for example, 


3,20-diketo/pregnane/, 
3-hydroxy/pregnane/, 
1l-hydroxy/pregnene/. 


From these, the program generated the following 
terms which were alphabetized and listed to form 
a dictionary: 


diketo 
hydroxy 


pregnane 
pregnene 


The extracts from the next patent contained the 
following compounds: 


3-hydroxy/pregnenc/. 
3,20-diketo/pregnane/. 
l-hydroxy/pregnane/. 
3-hydroxy/7-chloro/pregnane/. 


The first three compounds were automatically 
processed since the dictionary already contained 
the terms: + 

diketo 


pregnane 
hydroxy ae 


pregnene 


aer™ 


and these compounds Contain only dictionary terms 
even though different from the compounds in the 
first patent. However, when the fourth compound 
is processed, the machine recognizes that there 
is no chtoro in the dictionary and it prints out 
on the typewriter “Chloro—not in File.” and 
automatically punches out a card which contains 
the whole compound. At the end of the run, all the 
terms Which the typewriter indicated “not in file*™ 
are added to the dictionary. Now the “not in file” 
compounds which were not processed are sent 


through and automaticaily encoded. The new 
dictionary will now contain 
chloro hydroxy pregnene 
diketo pregnane 


Starting with ten steroid terms, we have gene- 
rated a dictionary of about seven hundred terms. 
A small sample of the dictionary is shown in ap- 
pendix E, The dictionary terms increase about 
four terms per steroid patent. Theaveragesteroid 
patent contains about fifty compounds. It is hoped 
that the number of terms added will materially de- 
crease as the size of the dictionary increases. 
Trivial names such as “progesterone,” if listed, 
are, of course, added to the dictionary. The 
dictionary is therefore self generating and every 
term therein has been used at least once. (See 
appendix E.) 


Encoding 


The punched cards generated by the dictionary 
are fed into the machine. For example, for 


3,20-diketo/11-hydroxy/pregnenc/. 


the machine recognizes the first portion indicated 
by the / and looks up diketo in the dictionary. In 
the dictionary the codes for diketo are stored with 
the term, provisions for handling the numbers 
(e.g. 3, 20) having already been made. The codes 
are then sent to the output of the computer. The 
machine then recognizes and processes “11 hy- 
droxy” and finally “pregnene.” The period.” indi- 
cates the end of the compound. The codes which 
have been sent to output are then either printed 
if there is no suitable punch available or, if one is 
available, the codes are punched in a punchedcard. 
it does not matter how a compound is na 

in the document. It will always receive the same 
code. For example: 


3,20-diketo/ 4 4-pregnenc/. 
A4-pregnene/3,20-dione/. 
progesterone/. 


are ali different methods of expressing the oe 
compound, and will each receive the same codes. 
The machine in the first instance processes diketo 
and then pregnene; in the second instance i 
processes pregnene and dione; and, in the third it ; 


isa le Mindint tall 


rita eres 


merely puts out the code for progesterone. The 
dictionary can thus contain both easily recognized 
parts of compounds 4s well as their trivial names. 
The code for dione and diketo in the dictionary are 
the same. The code for progesterone, listedin the 
dictionary, is the same as the combined listings 
of (3,20-diketo) or (2,20-dione) and 4 4-pregnene. 
A flow chart of the program for encoding is ap- 
pendixed (F). 


The cards, or the listed codes are suitable for 
use in the steroid ‘searching system (1). 

With slightly modified procedures, the polymers 
and phosphates can likewise be analyzed and en- 
coded. 


CONCLUSIONS 


A procedure for analyzing and encoding docu- 
ments suitable for use in a mechanized search 
system has been described. Three types of indices 
are prepared: 


1. Index of terms, in the language of the docu- 
ment, alphabetically listed for each document 

2. Index of terms, in the language of the docu- 
ments, of all the documents in the file, alpha- 
betically arranged 

3. Index of terms under preassigned headings 


A procedure for generating a dictionary and en- 
coding compounds has been disclosed. The methods 
discussed appear to offer a promising resolution 
to the problem of file preparation. There are 
several apparent advantages of this method. These 
are that (a) the bulk of the extraction can be per- 
formed by the use of personnel having less formal 
education, (b) on-the-job training can be short, 
(c) an accurate and reliable file is constructed, and 
(d) cost of file preparation is reasonable. 
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Appendix A 


A partial listing 6f all the steroid compounds in patent 2,905,678. The two digit number is one less 
than the number of times the compound occurred in the patent. 


2905678 6 ACETATE 4 PREGNENE 00 
2905678 5A 6B DIHYDROXY 3 KETO PREGNANE 00 
2905678 5A 68 DIHYDROXY 3 KETO PREGNANES 00 

~ 2905678 5A 6B DIHYDROXY PREGNANE 00 
2905678 5A 68 DIHYDROXY PREGNANES 01 
«2905678 «5A 6B DIHYDROXY 3 11 20 TRIKETO PREGNANE 00 
«2905678 5A 6B DIHYDROXY 3 11 20 TRIKETO PREGNANE 6 ACETATE 00 
2905678 5A 6B DIHYDROXY 3 11 20 TRIKETO PREGNANE 6 FORMATE 00 
! 2905678 5A HYDROXY 6 ACETOXY PREGNANE 00 
2905678 5A HYDROXY 6 ACETOXY PREGNANES 00 
-- 2905678 += 5A _ HYDROXY 6 FORMOXY PREGNANE 00 
2905678 5A HYDROXY 68 ACETOXY PREGNANE 00 
2905678 5A HYDROXY 6B FORMOXY PREGNANE 00 
2905678 © 5A HYDROXY 6B FORMOXY PREGNANES i 01 
2905678 5A 6B 17A TETRAHYDROXY 3 11 20 TRIKETO PREGMANE. 6 FORMATE 21 ACETATE 00 
2905678 5A 6B 17A 21 TETRAHYDROXY 3 11 20 TRIKETO PREGNANE 21 ACETATE 02 
2905678 5A 6B 17A 21 TETRAHYDROXY 3 11 20 TRIKETOPREGNANE 21 ACETATE 01 
2905678 5A 6B 17A TRIHYDROXY 3 11 20 TRIKETO PREGNANE 00 
2905678 5A 68 17A TRIHYDROXY 3 11 20 TRIKETO PREGNANE 6 FORMATE 03 
2605678 5A 6B 21 TRIHYDROXY 3 11 20 TRIKETO PREGNANE 6 FORMATE 21 ACETATE 03 


2905678 5A 6B 21 TRIHYDROXY 3 11 20 TRIKETO PREGNANE 6 FORMATE 21 BENZOATE 02 
2905678 5A 6B 21 TRIHYDROXY 3 11 20 TRIKETO PREGNANE 6 FORMATE 21 BUTYRATE 
2905678 5A 6B 21 TRIHYDROXY 3 11 20 TRIKETO PREGNANE 6 FORMATE 21 CAPROATE 


2905678 5A 6B 21 TRIHYDROXY 3 11 20 TRIKETO PREGNANE 6 FORMATE 21 HEMISUCCINAT 
2905678 = 4E 


2905678 6B 21 DIHYDROXY 3 11 20 TRIKETO 4 PREGNENE 6 21 SIACETATE 


2905678 6B 17A DIHYDROXY 3 11 20 TRIKETO 4 PREGNENE 6 FORMATE 


2905678 6B 21 DIHYDROXY 3 11 20 TRIKETO 4 PREGNENE 6 FORMATE 21. ACETATE 
2905678 6B 21 DIHYDROXY 3 11 20 TRIKETO 4 PREGNENE 6 FORMATE 21 BENZOATE 


2905678 68 21 DIHYDROXY 3 11 20 TRIKETO 4 PREGNENE 6 FORMATE 21 BUTYRATE 
2905678 6 


21. DIHYDROXY 3 11 20 TRIKETO 4 PREGNENE 6 FORMATE 21 CAPROATE 
2905678 68 21 DIHYDROXY 3 11 20 TRIKETO 4 PREGNENE 6 FORMATE 21 ESTERS 


2905678 68 21 DIHYDROXY 3 11 20 TRIKETO 4 PREGNENE 6 FORMATE 21 HEMISUCCINATE 
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2905678 
2905678 
2905678 
2905678 
2905678 
2905678 
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TRIKETO 
TRIKETO 
TRIKETO 
TRIKETO 
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TRIKETO 
TRIKETO 
TRIKETO 
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PREGNENE 


PREGNENE 
PREGNENE 
PREGNENE 
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PREGNENE 
PREGNENE 
PREGNENE 
PREGNENE 
PREGNENE 
PREGNENE 
PREGNENE 
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6 FORMATE 
6 FORMATE 
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United States Patent Office 


2,879,190 
-PHOSPHONATE COMPOUNDS 
Bill E. Burgert and Henry Tolkmith, Midland, Mich., as- 


signors to The Dow Chemical Company, Midland, 
Mich., a corporation of Delaware 


No Drawing. Application July 3, 1957 
Serial No, 669,718 


§ Claims. (Cl. 260—461) 


The present invention relates to phosphonate com- 
pounds) having the formula 


X - 
N-CHr-CH:-N’ 
7 . 
Xx: Xs 
In this and succeeding formulas, X, represents X2 or hy- 
drogen and X, represents a radical having the formula 
Oo OH 
—CH:P 
OR 


wherein R is a lower alkyl radical. The expression “Jow- 
er alkyl” is employed in the present specification and 
claims to refer to the alkyl radicals containing from 1 
to 4 carbon atoms, inclusive. These new compounds are 
very viscous liquids which are readily soluble in water 
and somewhat soluble in many organic solvents. They 
have been found to be useful as parasiticides and are 
adapted to be employed as active toxic constituents of 
dust and liquid compositions for the control of many 
insect and fungal pests such as Alternaria solani, Fusari- 
um solani, Pythium spp. and Rhizoctonia solani. 

The new compounds may be prepared by causing a 
reaction between ethylenediamine, formaldehyde and a 


Gialkyi phosphite) of the formula 


OR 


Where it is desired to introduce two —CH,PO(OH) (OR) 
groups into the molecule, one molecular proportion of 
ethylenediamine is employed with two molecular propor- 
tions of formaldehyde and two molecular proportions of 
dialkviphosphite} Where it is desired to introduce four 
—CH,PO(OH)(OR) groups into the molecule, one mo- 
lecular proportion of ethylenediamine is employed with 
four molecular proportions of each of the formaldehyde 
and disikyl he reactants} Although the details of 
the reaction mechanism are not completely understood, 
the reaction takes place smoothly at the temperature 
range of from 0° to 50° C. with the formation of the de- 
sired product and alkanol of reaction. The reaction is 
somewhat exothermic and the temperature may be con- 
trolled by regulating the rate of contacting the reactants 
and by external cooling. Upon completion of the reac- 
tion, the reaction mixture is fractionally distilled under 
reduced Pressure to separate low boiling constituents and 
obtain the desired product as a liquid residue. 

la carrying out the reaction, a mixture of ethylenedi- 
amine and dialkyl phosphitefis contacted portionwise with 
an aqueous solution of formaldehyde. This operation is 
carried out with stirring and at a temperature of from 
0° to 50° C. Upon completion of the reaction, the re- 
action mixture is fractionally distilled under reduced 
Pressure at gradually increasing temperatures up to a 
temperature of 150° C. to separate low boiling constituents 


2,870,190 
Patented Jan. 20, 1959 


2 
and obtain the desired product as a liquid residue. Since 
the desired products are somewhat unstable at tempera- 
tures in excess of 150° C., exposure to such elevated tem- 
eratures for any appreciable period should be avoided. 
5 The following examples illustrate the invention but are 
not to be construed as limiting: 


Example 1.—Exbyle U ’=bis(Q-eth 
amethanephosphonate)| 
10 Res 
CH;—-NH—CH;—P 
OC3Hs 
O OH 
| £ 7 
15 CH;—NH—CH;—P 
OC:Hs 


A 37 percent aqueous solution of formaldehyde con- 
taining 16.5 grams (0.55 mole) of formaldehyde was 
added portionwise with stirring to a mixture of 15 grams 
(0.25 mole) of ethylenediamine and 75.9 grams (0.55 
mole) of O,O-diethylphosphite.} The addition was car- 
ried out in 15 minutes and at a temperature of from 25° 
to 35° C. Following the addition, stirring was continued 
for 40 minutes, and the reaction mixture thereafter diluted 
with 70 grams of methylcyclohexane. The resulting mix- 
ture was fractionally distilled under reduced pressure at 
temperatures gradually increasing up to 150° C. to re- 
move low boiling constituents and obtain an ethylenedia- 
mine N,N’-bis(O-ethylmethanephosphonate) product] as 
a dark mn, very viscous, liquid residue having a den- 
sity of 1.4 at 25° C. This product was found to contain 
9.27 percent nitrogen, 20.50 percent phosphorus and 7.09 
percent hydrogen compared to the calculated values 9.21, 
20.40 and 7.29 percent, respectively. 


Example 2.—Ethylenediamine N,N,N',N’-tetra (O-ethyl 
—_methanephosphonate 


O OC:Hs 

CHr—N: CH 17 

40 Said aN 
OH a 


0 OCiHY 
7 
SHy—-N=| CHr-F 
45 on /s 


A 37 percent aqueous solution of formaldehyde con- 
taining 24 grams (0.80 mole) of formaldehyde was added 
portionwise with stirring to a mixture of 9 grams (0.15 
mole) of ethylenediamine and 109.1 grams (0.80 mole) 

5° of O,O-diethyl phosphite.) The addition was carried out 
in Poqainules Gad BT a Temperature of from 20° to 30° 
C. Following the addition, stirring was continued for 
90 minutes to complete the reaction. The reaction mix- 
ture was then processed as described in Example 1 to 
55 obtain an ethylenediamine N,N,N’,N’ - tetra(O - ethyl 
methanephosphonate) product] as a light tan, viscous, 
liquid residue having a density of 1.2 at 25° C. This 
product contained 5.49 percent nitrogen and 20.86 per- 


cent phosphorus compared to the theoretical values of 
GO 5.11 percent and 22.16 percent, respectively. 


Example 3—Ethylenediamine N,N’-bis(O-isopropy! 
methanephosphonate) ] 


25 


9 on 

85 GH:-NH—OHr-F. cH 
oof 

0 OH ‘CH: 

7 

CH;-NH—CH>-P Cth 
70 Sp asS 
Ss 


i= 


— 


The following patent and list is an example of the underlining and listing in a phosphate patent. 


2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 
2870190 


2870190 


2870190 


DIALKYL PHOSPHITE 


DIALKYLPHOSPHITE 


DIALKYL: PHOSPHITE 


ETHYLENEDIAMINE 
ETHYLENEDI AMINE 
ETHYLENCOLAMINE 
ETHYLENEDIAMINE 
eETHYLENEDIAMINI 
ETHYLENEDI AMINE 
ETHYLENEDIAMINE 
ETHYLENCDI AMINE 
ETHYLENEDIAMINE 
ETHYLENEDI AMINE 
ETHYLENEDIANIN] 
ETHYLENEDI AMINE 
ETHYLENEDIAMINE 


ETHYLENEDIAMINE 


N 


N 


REACTANTS 
N& BIS-G ETHYL METHANEPHOSPHONATE- 
N& BIS-O ETHYLMETHANEPHOSPHONATE 


N N& N& TETRA -O ETHYLMETHANEPHOSPHONATE 


O71 


00 


00 


00 


00 


00 


N NG N& TETRA-O ETHYLHETHANEPHOSPHONATE- PRODUC: 00 


N& BIS-O ISOPROPYL METHANEPHOSPHONATE 


N N& NE TETRA-O NORMALBUTYL METHANEPHOSPHONATE 
METHYLPHOSPHONATES 0 
N& BIS-O METHYL METHANEPHOSPHONATE 

N& BIS-O 1SOBUTYL METHANEPHUSPHONATE 

N N& N& TETRA-O METHYL. HETHANEPHOSPHONATE- 
N N& NE TETRA-O NORMALPROPYL te THANEPHOSPHONATE 
N N& NE TETRA-O ETHYL METHANEPHOSPHONATE 

N& BIS-O ETHYL NETHANEPHOSPHONATE 


N& BIS-O ISOPROPYL HETHANEPHOSPHONATE 


O 0 DIETHYL PHOSPHITE 


0 0 
00 


vO 


DINORMALBUTYL PHOSPHIJE 
OIMETHYL PHOSPHITE 


OITSOBUTYL PHOSPHITE 


O DiNORMALPROPYL PHUSPHITE 


ie) 
00 DI 
(6) 


-LOWER ALKYL- 


PHOSPHITE 


O DIISOPRUPYL PHOSPHITE 


PHOSPHONATE COMPOUNDS 


PHOSPHONATE COMPOUND 


00 


00 


00 


Appendix B 


The following ds the flow chart used on the RAMAC 305 for eliminating the duplicates. 
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The following flow chart illustrates the letters A, B, C, D, E, and F as examples. 
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Appendix C 


This appendix contains a copy of the underlined patent followed by a list of all compounds which are 
solvents, monomers, and catalysts. The 2-digit number at extreme right indicates one less than the 


number of times the compound occurs. 


Patented Nov. 16, 1943 2,334,195 
UNITED STATES PATENT OFFICE 


2,334,195 


PROCESS FOR THE PRODUCTION OF POLY- 
MERIZATION PRODUCTS 


Heinrich Hiopff and Siebert Goebel, Ludwigs- 
hafen-on-the-Rhine, Germany, asaignors, by 
meane assignments, to General Aniline & Film 
Corporation, New York, N. ¥., 3 corporation pf 
Deleware 


No Drawing. Application March 6, 1940, Serial 
No. 322,547. In Germany March 14, 1939 


5 Claims. (CL 260—80) 


in conjunction with polymerizable substances are 
for example maleic acid diethyl ester and di- 
methyl ester. By varying the proportions of the 
components employed, the properties of the poly- 
merization products may be varied considerably. 

The polymerization products are suitable for a 
great variety of applications. They may be 
worked as plastics by injection-moulding or used 
for coating electrical conductors or for the man- 
ufacture of cgndensers. When dissolved in _or- 
ganic solvents they may be used for the prepara- 
tion of coatings resistant to acids and alkalies. 

The following examples serve to illustrate how 
our present invention may be carried out in 
Practice, but the invention is not restricted to the 
said examples. The parts are by weight. 


Ezrample 1 


The present invention relates to @ process for 
the production of polymerization products. 

The copending application Ser. No. 309,238, 
filed December 14, 1939, by the present inven- 
tors and Curt W. Rautentrauch, describes a proc- 5 
ess for the production of polymerization prod- 
ucts In which ethylene? ff desired in admixture 
with other polymerizable compounds, is subjected 
to polymerization in an aqueous emulsion in the 
presence of substances supplying oxyeen! 10 

We have now found that _ethylene;may be con- 
verted into solid or semi-solid polymerization. 
products in a very simple manner by effecting its 
polymerization in the presence of organic sol- 
yents*and oxygen/or substances suppl = 16 
gen./ The organic solvents may be used either 
alone or in combination with aqueous liquids in 


which the _solvents"and the_ethylene’ are emuls!- 4 
fled or dissolved. By our new method of working 109 parts of liquid ethylene; 100 parts of ben- 


the solid polymerization products may be ob- 99 Zne°and 2 parts of benzoyl peroxide’ are heat- 
tained under relatively low superatmospheric ed in a pressure vessel at from 80 to 100° C, 
pressures. Comparatively small proportions of After a short time the initial pressure of 100 
_organic solvents*$ay from 10 to 100 per cent (per- Atmospheres falls to about 70 atmospheres, Aft- 
centage by weight with reference toethylenePare &F removing any unpolymerized ethylene’and dis- 
sufficient for obtaining good yields of solid poly- og tilling off the solvent; the polymerization prod- 
merization products at between about 100 and uct is left behind in the form of a white waxy 
about 300 atmospheres, whereas considerably mass melting above 100° which may be purified 
higher pressures are required when conducting by redissolving irom grganic solvents, 
the process in the absence of_organic solventsi When using toluene’instead of_benzeng™ & val- 
Among the substances having an accelerating 30 uable polymerization product is likewise obtained. 


effect in the polymerization we may mention oxy- =f trichlorethylene'is employed instead of ben- 


Z : interpolymerization product containing 
gen! or substances supplying oxygen/ especially  Zene-an 
peroxidic substances/ for example persulphates,) Chlorine ts obtained having similar properties. 
percarbonstes, perborates as well as benzoyl-\ | 


szrample 2 
roxide, pe: peroxide, tolule } 
aeiiooet oe Acid, diacetyiperoxtde, Salle 35 259 parts of_benzene’and 500 parts of ethylene? 


tainable from the acid chlorides by means of hy- re emulsified in a pressure vessel in 1000 parts 

drogen peroxide and caustic soda solution. of an aqueous solution containing 20 parts of the 

As suitable organic solyents-we may mention Sodium salt of alpha-hydroxyoctodecane sul-) 
phonic acid, 1.5 parts of 


by way of example benzene, toluene, xylene and 
halogenated hydrocarbons, such as ethylene ehlo-{.2» 7.8 parts of hydroxen peroxigeland 10 parts of 
Hide, carbon tetrachloride and chlorbenzene. Benzoyt peroxide and polymerized under a Pres 
Interpolymerization products may be produced Sure of 150 atmospheres at between 100° and 
ae p Sarre 110° C. After 2 hours the emulsion Is precipi- 


by treating ethylene together with further unsat- 
b “ n ee in their turn are capable 45 tated by the addition of an electrolyte, such as_ 


of polymerization under the conditions employed, 
i, e. substances capable of polymerizing alone 
under the conditions concerned and substances 
which polymerize only when exposed to the poly- 


merizing conditions in conjunction with sub- 60 


stances polymerizable by themselves. Among 


common salt, A white powder is obtained hay- 
ing a melting point of above 100° C. 


izrample 3 


100 parts of e lene, 200 parts of “methanol” 
and 2 parts of i Jare mixed in a 
pressure resisting vessel provided with a stir- 


substances polymerizable by themselves we may : 
mention by way of example propylene, isobu-) , ‘ings device and heated to from 110° to 120° C. 
Say) Py -Propyiené,- > for 2 hours. The pressure, being 80 atmospheres 


tylene, butadiene, styrene and Se 
compounds not polymerizable by themselves but’ gs at the outset, rises to 160 atmospheres and falls 
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a 


ACRYLIC ACID ESTERS-3 
ACRYLIC ACID ETHYL ESTER-3 

BENZOYL PEROXIDE-1 

BENZENE~2 

BUTADIENE-3 

BENZOYLPEROXIDE-| 

BUTANOL-2 

CARBON 1ETRACHLOR 1DE-2 

- CHLORBENZENE-2 

CHLORINATED ALIPHATIC HYDROCARBON-2 
_ DIACETYLPEROX IVE-1 

DICHLORETHYLENE-2 

ETHYLENE-3 

ETHYLENE CHLORIDE-2 

-HALOGENATED HYDROCARBONS-2 

HYDROGEN PEROXIDE-| 


Pian 


_ ISOBUTYLENE-3 


. OLEIC PEROXIDE-1 
ORGANIC SOLVENi-2 

5  PEROXIDIC SUBSTANCES-~1 
 PERSULPHATES-1 


0 SSIUM PERSULPHATE-1 


2334195 TRICHLURETHYLENE-2 00 


243195 UNSATURATED SUBSTANCES-35 06 
2334195 UNSATURATED COMPOUNU-4 (16) 
23354195 XYLENe-2 00 


The following is a list ofallofthe catalysts in 2,334, 195. The 2-digit number plus one at the extreme 
right is the number of times the compound occurs in the patent. 


ral 


: 2354195  BENZOYL PEROXIDE-1 05 
2334195 BENZOYLPEROXIDVE-! OG 
2554195 . VIACETYLPERUXIDE-1 06 
2354195 HYDROGEN PEKOXIDE-1 01 
; 2354195 OXYGEN-] 03 
2334195 OLEIC PERUXIDE-1 : 0} 
2334195 PEROXIDIC SUBSTANCES-1 00 
| 2334195  PERSULPHAIES-1 00 
2334195 PERCARBUNA1ES~1 00 
2534195. PERBORATES-1 : 00 
+ 2354195 PERACETIC ACID-1 00 
| 2334195 POTASSIUM PERSULPHATE-! ot 
2334195  PEROXIDE-1 : 00 
2534195 SUBSTANCES SUPPLYING UXYGEN<1 O& 
2334195 = TOLUIC ACID PEROXIDE-1 vo 
The folowing is a list of all solvents in 2, 334, 195: 
2330195  BENZENE-2 OS 
2334195  bUTANOL-2 v1 
e 2334195  CAKBON TETRACHLURIDE-2 00 
2334195 CHLORBENZENE-2 v0 
| 2334195 CHLORINATED ALIPHATIC HYDROCARBON-2 00 
© 2334195  DICHLORETHYLENE-2 a9 
2334195 ETHYLENE CHLURIVE-2 ay 
2334195 HALOGENATED nYIROCARSUNS-2 90 
2334195  METHANOL=2 we 
) 2334195 ORGANIC SuLventS$-2 05 
2334195  URGANIC SULVENT-2 vl 


2534195 SOLVENTS-2 © ay 
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2334195 
2334195 
233u195 


23534195 


The following is a list of all the momomers in 2, 334, 195: 


2338195 
255K 195 
2334195 
2552195 
2354195 
23346195 
2358195 
2534195 


2334195 


SGLVENT-2 
TOLUENE-2 
TRICHLORETHYLENE-2 


XYLENE-2 


ACRYLIC ACIO ecaTeRS-3 


ACRYLIC ACID cTHYL ESTER-5 


BUTADIENE-5 

clTHYLENe-3 

ISUBUTYLENE-35 
PROPYLENE-3 

STYRENE-3 

UNSATURATED SUBSTANUES-3 


UNSATURATED CUMPOUND-3 
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Appendix D 


A copy of a steroid patent underlined for automatic encoding and a list of the compounds extracted 
therefrom. ‘Two digit number is one less than the number of times the compound occurs in the patent. 


United States Patent Office 


i 


2,871,246 


PROCESS FOR 3-14 


K a = 
_NADIEN|29-0% ; Sai u 


BJjarte Liken, Roosevelt, Puerto Rice, assignor te G. D. 
Searle & Co., Chicago, HL, a corporation of Delaware 


No Drawing. Application May 28, 1958 
Serial No. 738,395 


5 Claints. (Cl. 260—397.4) 


10 


The instant invention relates to the preparation 


of 3g- 15 
hydroxy{S-ailkylf5 16-prenadien}20-ones and their ra 


of the general structural formula 

ony 
yee 
c=0 
CH; I 


| ie 


| 25_alkyl-5,16-pregnadi 


“ 


In the foregoing structural formula the lower alkyl radi- 
oul in the 6-position can be a straight-chain or branched 
propyl, butyl, amyl or hexyl group, but it is preferably a 
methyl or ethyl radical. The radical R can represent 35 
hydrogen or an acyl radical of a hydrocarbon carboxylic 
acid, preferably of less than 7 carbon atoms. Examples 
of such acyl groups are formyl, acetyl, propionyl, butyryl, 
valeryl and hexanoy]. 

The foregoing compounds are progestationally active. 49 
They also serve as starting materials for the preparation 


N 
S 

y 

(ower alkyl) 


of _6-alkylprogesterones|and_6-alkylj17«-_hydroxyproges- 
esjand their 17-esters|which are potent progestational 


agents. 

The procedures in the literature for conversion of dios 45 
-genin|or cholesterol) to pregnadiene/ derivatives all suffer 
from serious disadvantages in specific steps of the reac- 
tion sequence. Actually only the first conventional step 


of isomerizing a_6-alkykliosgenin| to its, corresponding 
pseudoderivative, 38,26-diacetoxy{6-alkyl}5,20(22)-furo- 50 
stadieneJcan be conducted by a procedure analogous to 


that generally employed for diosgenin] namely reaction 
with acetic anhydride for 6 hours at 190-195° C. The 
subsequent steps of the sequence had to be altered mate- 
tially and in some steps, as in cleavage of the 5-acetoxy-y- 55 
methylpentanoyl ester, entirely different procedures had 
to be devised. 

Thus, the second step, namely oxidation of the furo- 

i to the 16-(3-acetoxy-y-methy]) pentanoa y 
chromium trioxide in. an acetic acid medium is advan- 60 
tageously effected in the presence of a chloralkane sol- 
vent amounting to 15-60% by volume of the oxidation 
medium. The chlorinated solvents contemplated are the 
chlorinated hydrocarbons of one to three carbon atoms 
including, for example, ethylene chloride, chloroform, (5 
carbon tetrachloride, ethylene trichloride and propylene 
chloride. Effecting the oxidation in this manner facili- 
tates recovery of the steroid from the reaction mixture. 
Thus the chromium salts and acetic acid are readily re- 


2,871,246 
Patented Jan. 27, 1959 
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can then be recovered by evaporating the chlorinated hy- 
drocarbon, 

The subsequent cleavage iv effected by a novel solvo- 
lyals with at least three equivalents of alkali in aqueous 
acetone, A rapid reaction occurs with very strong alkali, 
@. g. caustic soda and potash. More time is needed with 
weaker bases such as the alkali meta! carbonates. The 
acetone-water ratio can vary from about 1:3t03:1. The 
alkali, ¢, g. sodium hydroxide, can constitute from 1-209 
by weight of the alkaline medium, The time period for 
the treatment is relatively short, ranging from about S-60 
minutes. Treatment temperatures are in the range of 20° 
C. to about 65° C. Temperature is relatively critical 
because two different results are attainable. In the lower 


range of about 20-33° C., cleavage of the substituted pen- 
tanoate results in a high yield of 38-a¢ m-alkvilS 16- 


ien}20-one] On the other hand, in the range of 


about 45-65° C, there also occurs saponification of the 
3-acylate to yield. Sechydroxwl @-alkvll$ 16-preanedien{20- 
0 


It is within the contemplation of the instant inven- 
tion to carry out the same alkaline acetone-water treat- 
ment further at higher temp 
acyloxy ° ; 


hese compounds are valuable as key intermediates in 
the overall conversion of diosgenin| into medically active 
6~alkyl steroids. The 38-hydroxy{6-alkyl}5,16-pres 


2 s|have certain advantages over thelr >-esters 
30 as intermediates, Epoxidation to _38-hydroxy}6-alkyl- 
q -preg: 2 s} can be effected by re- 


: 
action with alkaline hydrogen peroxide in methanol solu- 
tion to give a better yield. Also the 3-hydroxy com- 
pounds can be converted by the Oppenauer oxidation to 
the_6-alkyl}5,16-pregnadien}3,20-diones} then by Raney 
nickel reduction to progestationally active G-alkylproges- 


_terones/ or epoxidation as above outlined to the 16«,17a- 


epoxides, followed by conversion to the 17<-hydroxy com- 
pounds according to conventional practices. The resuli- 
ing esters of 17-hydroxyprogesterone{ are highly potent 
progestational agents. It has also been found that the 
3-hydroxy-6-alkyl-5,16-pregnadien-20-onesj are progesta- 
tional agents. 

The following examples describe the invention in fur- 
ther detail but are not to be construed as limiting in spirit 
or in scope. Quantities are given in parts by weight. 


Example I 


To an ice-cooled solution of 100 parts of diosgeninf 
in 1340 parts of dichloromethane there is added a solu- 
tion of 160 parts of acetone, 270 parts of dichloro- 
methane, 2 parts of sodium acetate and 75 parts of peroxy- 
acetic acid. After completion of the addition, the mix- 
ture is left at room temperature for 5 hours and then 
washed with 1000 parts of a 5% ferrous sulfate solu- 
tion which serves to reduce the excess peroxyuc acid 
The organic layer is separated and caretully washed with 
water. The washings are reextracted with dichloro- 
methane and the combined dichloromethane extracts are 
taken to dryness on a steam bath. ‘hen 900 parts of 
heptane are added and a small head fraction is distilled 
off in order to remove residual dichloromethane. The 
mixture is cooled to 30° C, under agitation and the 
crystalline material is collected on a filter. This materia! 
consists of a mixture of the a- and S-epoxides; the optical 
rotation ap of —120° in chloroform solution indicates 
predominance of the a-epoxide, The mother liquor i 
evaporated to a small volume and 4 second crop is 
harvested with a rotation ap of —10?*, which is also 
a mixture of the two epimeric epoxides, but in this crop 
the A-cpoxide predominates. The first crop ts dissolved 
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2871246 6 ALKYL/PROGESTERONES/» 


ol 

2871246 & ALKYL/17A/17HYDROXY/PROGESTERONES/ 6 ; 00 
2871246 6 ALKYL/DIOSGENIN/« ' 00 
2871246 3 ACYLOXY/16-D ACETOXY G METHYLPENTANOYLOXY=/5 PREGNEN/200NE/o ol 
2871246 6 ALKYL/S 1L&PREGNADIEN/3 20DIONES/e 00 
2871246 6 A/& METHYL/16A/17A/1617EPOXY/4 PREGNENE/3 20DIONE/o 00 
2871246 6 A/& METHYL/17A/17HYDROXY/PROGESTERONE/e 00 
2871246 3 ACYLATES/3 8/3 HYDROXY/6 METHYL/5 16PRESNADIEN/200NES/e 00 
2871246 3 ACYLOXY/26ACETOXY/6 METHYL/5 20-22-FUROSTADIENE/e 00 
2871246 3 ACETOXY/& METHYL/16-D ACETOXY G METHYLPENTANOYLOXY-/5 PREGNEN/200NES 00 
2871246 s/e 00 
2871246 3 B/3 HYDROXY/6 ALKYL/5 18PREGNADIEN/200NES/e ol 
2871246 3 B/3 26DIACETOXY/6 ALKYL/5 20~22-FUROSTADIENE/e 00 
2871246 3 8/3 ACYLOXY/6 ALKYL/5 16PREGNADIEN/200NE/o ol 

2871246 3 B/3 HYDROXY/6 ALKYL/5 16PREGNADIEN/200NE/e 00 

2871246 3 B/3 HYDROXY/6 ALKYL/16A/17A/1617EPOXY/5 PREGNEN/200NES/o 00 

2871246 3 8/3 HYDROXY/5 A/6 A/5 6 EPOXY/22A/22ALLOSPIROSTANE/e 00 

2871246 6 B/6 METHYL/22A/22ALLOSPIROSTANE/3 B/5 A/3 5 DIOL/e 02 

2871246 3 B/3 ACETOXY/6 RB/6 METHYL/22A/22ALLOSPIROSTAN/5 A/5 Ol/o 03 

2871246 6 8/6 METHYL/22A/22SPIROSTANE/3 B/5 A/3 5 DIOL/e ope 


2871246 2 B/3 ACETOXY/16B/16-D ACETOXY G METHYLPENTANOYLOXY-/6 METHYL/5 PREGNE 01 


2871246  sN/200NE/e ol 
2871246 3 B/3 HYDROXY/6 METHYL/5 16PREGNADIEN/200NE/6 05 
2871246 3 B/3 ACETOXY/5 A/6 A/5 6 EPOXY/22A/22SPRIOSTANE/«o 00 
2871246 6 B/6 STHYL/22A/22ALLOSPIROSTANE/3 B/5 A/3 5 DIOL/e 00 
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Appendix F 


*& flow chart of a program for automatic encoding. 
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PRoERAN *L 


WovseKeeriae 
Routine 


The following fJow chart shows examples of the alphabetic routines for letters A-P: 
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